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ABSTRACT 

We estimate the two-point correlation function in redshift space of the recently 
compiled HIPASS neutral hydrogen (HI) sources catalogue, which if modeled 
as a power law, £(r) = (ro/r) 7 , the best-fitting parameters for the HI selected 
galaxies are found to be r$ = 3.3 ± 0.3 Mpc with 7 = 1.38 ± 0.24. Fixing 
the slope to its universal value 7 = 1.8, we obtain ro = 3.2 ± 0.2 h^ 1 Mpc. 

Comparing the measured two point correlation function with the predic- 
tions of the concordance cosmological model (Ha — 0.74), we find that at 
the present epoch the HI selected galaxies are anti-biased with respect to the 
underlying matter fluctuation field with their bias value being &o — 0.68. Fur- 
thermore, dividing the HI galaxies into two richness subsamples we find that 
the low mass HI galaxies have a very low present bias factor (60 — 0.48), 
while the high mass HI galaxies trace the underlying matter distribution as 
the optical galaxies (60 — 1)- Using our derived present-day HI galaxy bias we 
estimate their redshift space distortion parameter, and correct accordingly the 
correlation function for peculiar motions. The resulting real-space correlation 
length is r T e = 1.8 ± 0.2 h^Mpc and r l Q c = 3.9 ± 0.6 h _1 Mpc for the low and 
high mass HI galaxies, respectively. The low-mass HI galaxies appear to have 
the lowest correlation length among all extragalactic populations studied to- 
date. In order to corroborate these results we have correlated the IRAS-PSCz 
reconstructed density field, smoothed over scales of 5 hr 1 Mpc, with the posi- 
tions of the HI galaxies, to find that indeed the HI galaxies are typically found 
in negative overdensity regions (S p / ppscz^ 0), even more so the low Hi-mass 
galaxies. 

Finally, we also study the redshift evolution of the HI galaxy linear bias 
factor and find that the Hl-galaxy population is anti-biased up to z ~ 1.3. 
While at large redshifts z ~ 3, we predict that the HI galaxies are strongly 
biased. Our bias evolution predictions are consistent with the observational 
bias results of Lyman-a galaxies. 

Keywords: galaxies: clustering - HI sources - cosmology: theory - large- 
scale structure of universe 



1 INTRODUCTION 



The clustering of the different extragalactic sources is 
an ideal tool for testing theories of structure formation 
as well as studying the large-scale structure (eg. Pee- 
bles 1993). The traditional indicator of clustering, the 
two-point correlation function, is a fundamental statis- 
tical test for the study of the extragalactic distribution 
and is relatively straightforward to measure from ob- 
servational data. Recently the new generation redshift 
surveys such as the Point Source Catalogue for Red- 
shift (PSC-z; Saunders et al. 2000), the European Large- 



Area ISO survey (ELAIS; Oliver et al. 2000), the Sloan 
Digital Sky Survey (SDSS; York et al. 2000) the 2dF 
Galaxy Redshift Survey (2dFGRS; Colless et al. 2001), 
the DEEP 2 Galaxy Redshift Survey (DEEP2; Davis et 
al. 2003) and the VIMOS-VLT Deep Survey (VVDS; Le 
Fevre, et al. 2005) have extended the clustering studies 
and now we are able to extract definitive measurements 
of galaxy clustering in the local and relatively distant 
Universe. 

Indeed Zehavi et al. (2002) and Hawkins et al. 
(2003) have found r ~ 5 /i _1 Mpc and 7 ~ 1.7 for the 
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SDSS galaxies, while Norberg et al. (2001) found for the 
2dFGRS galaxies that r ~ 4.9fr~ 1 Mpc and 7 ~ 1.7. 
Also, Coil et al. (2006) have found r ~ 4 7i _1 Mpc and 
7 ~ 1.7 for the DEEP2 galaxies at z ~ 1 and Guzzo et 
al. (2007) derived r ~ 3.6 ft _1 Mpc for the VVDS galax- 
ies at z ~ 1.5. In the infrared side, Jing, Borner & Suto 
(2002) derived r ~ 3.7ft _1 Mpc and 7 ~ 1.7, using the 
IRAS-PSCz data, while Gonzalez-Solares et al. (2004) 
using the ELAIS sources found ro ^ 4.3h~ Mpc and 
7 ~ 2 correlation functions. From a cosmological point 
of view, it is imperative to understand how the different 
types of galaxies trace the underlying mass distribution. 
It is well known that the large scale clustering pattern 
of different mass tracers (galaxies, AGN, clusters, etc) 
is characterized by a bias picture (Kaiser 1984; Bardeen 
et al. 1986). In particular, biasing is assumed to be sta- 
tistical in nature; galaxies and clusters are identified as 
high peaks of an underlying, initially Gaussian, random 
density field. Biasing of galaxies with respect to the dark 
matter distribution was also found to be an essential in- 
gredient of Cold Dark Matter (CDM) models of galaxy 
formation in order to reproduce the observed galaxy dis- 
tribution (eg. Benson et al. 2000). 

In this paper we utilize the recently completed HI 
Parkes All-Sky Survey (HIPASS; Barnes et al. 2001), 
which is the largest uniform sample of HI selected galax- 
ies in the local Universe, attempting to make a detailed 
investigation of the connection between the clustering 
and the biasing properties of HI selected galaxies. In 
particular we determine, in the framework of the con- 
cordance ACDM cosmology, the relative HI galaxy bias 
at the present time and describe the corresponding bias 
as a function of redshift. For a detailed study of the ob- 
servational clustering properties of the HIPASS sources 
we refer the reader to the recent works of Ryan- Weber 
(2006) and Meyer et al. (2007). We also refer the reader 
to the new Arecibo Legacy Fast ALFA (alfalfa) drift 
survey, especially designed to probe the faint end of the 
HI mass function in the local universe (Giovanelli et al. 
2005a; 2005b). 

The structure of the paper is as follows. In sec- 
tion 2 we discuss the HI galaxy dataset, its measured 
redshift-space correlation function, the determination of 
its present day bias factor, the corrected for redshift 
space distortion real-space correlation function as well 
as an investigation of the environment of HI galaxies. 
In section 3 we present two bias evolution models and 
study the corresponding evolution for the case of HI 
galaxies within the concordance ACDM model. Finally, 
we draw our conclusions in section 4. 



2 ESTIMATION OF THE HI SELECTED 
GALAXIES CORRELATION FUNCTION 

2.1 The HIPASS data 

The recent HIPASS catalogue (for more details see 
Meyer et al. 2004; Ryan- Weber 2006), contains 4315 
neutral hydrogen (HI) sources in the southern region 
of the survey (<$ < +2°), covering an area of ~ 2.077T on 
the sky between 300 km s _1 and 12700 km s _1 (see also 
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Figure 1. The measured (histogram) and the fitted (con- 
tinuous line) number of the HI galaxy sources as a function 
of distance. The dashed line corresponds to a volume lim- 
ited expectation. The insert panel shows the corresponding 
selection function for the Si (solid) and the S2 (hatched) 
subsamples, respectively (see section 2.2). 

Doyle et al. 2005). We exclude detections of which the 
expected completeness is very low (C < 0.5) in order 
to avoid introducing systematic effects related to the 
HI galaxy space density (estimation of the complete- 
ness limit C is given in Zwaan et al. 2004; 2005). The 
remaining sample contains ~ 4008 HI selected galax- 
ies with masses M m < 4 x W m h- 2 M Q *. In the mean 
time northern extension of the HIPASS has been pub- 
lished containing 1002 HI detections in the following 
area 2° < S < 20° (Wong et al. 2006). 

The measured radial velocities are translated to 
the Local group frame and are converted to proper 
distances using a spatially flat cosmology with Ho — 
100 fckms -1 Mpc -1 and fi m = 1 - f2 A = 0.26. 

In Fig. 1, we present the estimated (histogram) 
and that expected for a volume limited sample (dashed 
line), number of the HI sources as a function of dis- 
tance. It appears that the HI galaxy distribution is 
roughly volume-limited out to 10ft _1 Mpc, a fact corrob- 
orated also by the x 2 minimization test between model 
and observations, which gives a reduced xVdof (up to 
r < 10/i _1 Mpc) of 1.04. However, due to the fact that 
the HIPASS catalogue is a peak flux limited sample it 
suffers from the well known degradation of sampling as 
a function of distance (codified by the so called selection 
function). Thus, we can attempt to parametrize the HI 
galaxy distance distribution, for this particular effect, 
using the following formula: 

^=dfir 2 <p)0(r) (1) 
where (f>(r) is the parametrized selection function 

0(r) = df^) rQeXPh(r/rflr] (2) 

* Zwaan ct al. (2005) used H = 75h 75 kms -1 Mpc -1 . In 
that case the maximum HI mass is ~ 7.1 X 10 10 h^ 2 Mq 
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Figure 2. The space density of the HIPASS galaxy data as 
a function of distance. 



with 



A = 



Nm 



T[(a + 3)/m] r 



a + 3 



(3) 



Note that N is the total number of the HI sources, dQ is 
the solid angle of the survey, ( p ) is the mean density of 
the HIPASS sample (see below) and the factor V is the 
Gamma function. The solid line in Fig.l corresponds 
to the best-fitting dN/dr, which is determined by the 
standard \ 2 minimization procedure in which each part 
of the histogram is weighted by its Poisson error. The 
reduced xVdof is ~ 0.90 and the corresponding val- 
ues of the fitted parameters are r s = 16.5t.2 ( sf l ~ 1 Mpc, 
rn = 1.24±o.o! and a = -0.30 ± 0.12 (the uncertain- 
ties correspond to la errors). In Fig. 2 we present the 
number density of our sample as a function of distance. 
Up to ~ 10 h' 1 Mpc the corresponding space density 
is roughly constant, being (p) = 0.14 ± 0.02 h 3 Mpc -3 
and then drops because of the sample flux limit. 



2.2 HI selected galaxy redshift-space 
correlation function 

We estimate the redshift space HI galaxy correlation 
function using the estimator described by Efstathiou et 
al. (1991): 



Ur) = f 



Ndl 



1, 



(4) 



where Ndd is the number of HI pairs in the interval 
[r — Ar, r + Ar], and Ndr is the number of data-random 
pairs. In the above relation the normalization factor is 
/ = 2N R /(N D - 1), with N D and N R the total number 
of data and random points, respectively. The random 
distributions were constructed using 100 Monte Carlo 
realizations, respecting the survey area and taking into 
account all possible systematic effects of the data (eg. 
completeness, fraction of HI sources missed by the find- 
ing algorithm due to the HIPASS flux limit and the 
redshift distribution of the HI galaxy data). Note, that 
we have tested that our random catalogues reproduce 
exactly the corresponding HI selection function. 

In the left panel of Fig. 3, we present the measured 
redshift-space two point correlation function in logarith- 
mic intervals (dots). The clustering is evident, although 
the correlation function beyond r > 11 /i _1 Mpc drops 
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Figure 3. Left panel: The HIPASS galaxy spatial two-point 
correlation function (points) in redshift space. The uncertain- 
ties were estimated using 100 bootstrap re-samplings of the 
data (Mo, Jing & Borner 1992). Right panel: The HIPASS 
correlation function in real space. The dashed line represent 
the best-fitting power low f(r) = (ro/r) 7 (see parameters 
in Table 1), while the continuous line represents the best fit 
ACDM (fi A = 0.74) model with b = 0.68. 



dramatically. The dashed line corresponds to the best- 
fitting power law model £(r) = (ro/r) 7 , determined by 
the standard y 2 minimization procedure, in which each 
correlation point is weighted by the inverse of its error. 

In Fig. 4 we present the iso-A^ 2 contours (where 
A% 2 — X 2 ~ Xmin) in the ro — 7 plane (see the index 
All) . The Xmin i s the absolute minimum value of the \ 2 ■ 
In Table 1 we present the best fit parameters, derived 
for r > 1 ft -1 Mpc in order to avoid the signal from 
the smallest and therefore the highly non-linear scales 
(eg. Vasilyev et al. 2006). Note that our results remain 
robust by varying the upper r limit within the 5 to 20 
hT 1 Mpc range. 

The resulting clustering parameters T are ro = 
3.3±0.3/i _1 Mpc and 7 = 1.38±0.24. Fixing now the cor- 
relation function slope to its nominal value of 7 = 1.8, 
we obtain ro = 3.2±0.2/i _1 Mpc, but now the fit is good 
for r > 2/i _1 Mpc. Note that our results are in good 
agreement with a similar analysis of the HIPASS data 
by Ryan- Weber (2006), who find r = 3.1 ± 0.5/i _1 Mpc 
with 7 = 1.4 ± 0.5. 

It is interesting to compare our results with other 
recent galaxy correlation function determinations in the 
local Universe. In the optical, Zehavi et al. (2002) and 
Hawkins et al. (2003) find r ~ 5 /i _1 Mpc and 7 ~ 1.7 
for the SDSS galaxies, while Norberg et al. (2001) find 
r ~ 4.9/i _1 Mpc and 7 ~ 1.7 for the 2dFGRS galaxies. 
In the infrared, Jing, Borner & Suto (2002) find for the 
IRAS-PSCz data r ~ 3.7/i _1 Mpc and 7 ~ 1.7, while 
Gonzalez-Solares et al. (2004) using the ELAIS sources 
found ro — 4.3/i _1 Mpc and 7 ~ 2. It should be pointed 
out that the HIPASS galaxy correlation function has 
a significantly shallower slope than the optical and in- 
frared galaxy correlation functions. 



t The robustness of our results to the fitting procedure was 
tested using different numbers of bins (spanning from 10 to 
20) and we found no significant differences. 
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Table 1. Results of the correlation function analysis of the HIPASS sample. Errors of the fitted parameters represent 3a 
uncertainties. Finally, the ro has units of h~ x Mpc. 



Sample 


N 


ro 


7 


r (7 = 1-8) 


bo 


P 


K(f3) 


r 


r-( 7 = 1.8) 


All 


4008 


3.3 ± 0.3 


1.38±0.24 


3.2 ±0.2 


0.68 ±0.10 


0.66 


1.52 


2.4 ±0.3 


2.5 ± 0.2 


Si 


2389 


2.4 ±0.3 


1.72±0.46 


2.4 ±0.2 


0.48 ± 0.10 


0.93 


1.79 


1.8 ±0.3 


1.8 ± 0.2 


s 2 


1619 


4.9 ± 1.0 


1.40±0.42 


4.6 ±0.7 


0.94 ±0.15 


0.47 


1.36 


4.0 ±0.9 


3.9 ± 0.6 




2 4 6 

r (h _1 Mpc) 

Figure 4. Iso-Ax 2 contours in the ro — 7 parameter space 
for the whole catalogue and separately for the Si and S2 
samples. The contours correspond to la (Ax 2 = 2.30), 2a 
(Ax 2 = 6.17) and 3a (Ax 2 = 11.8) uncertainties, respec- 
tively. 



In order to investigate the possible HI mass de- 
pendence of the correlation function, we divided the 
sample into two subsamples using the following mass 
thresholds: (a) M H i < 2.475 x 10 9 h~ 2 M Q , hereafter 

51 sample containing 2389 entries, and (b) Mm > 
2.475 x 10 9 h~ 2 M Q , hereafter S 2 sample containing 1619 
entries. We apply our statistical analysis to each HI sub- 
sample by taking into account the individual redshift se- 
lection functions (see insert of Fig.l). The resulting cor- 
relation function parameters are presented in the last 
two rows of Table 1 and in Fig. 4, where we show the 
iso-A^ 2 contours for both the Si and S2 cases, sepa- 
rately. In Fig. 5, we present the estimated two point red- 
shift correlation function for the Si (solid circles) and 

52 (open squares) subsamples, respectively. The lines 
correspond to the best-fit power law model. In particu- 
lar, for the low HI mass galaxies (Si sample) we obtain 
r = 2.4 ± 0.3/i _1 Mpc and 7 = 1.72 ± 0.46 (continuous 
line). While, for the high mass HI galaxies (S2 case) we 
find a larger correlation length, ro = 4.9 ± 1.0/i _1 Mpc 
and 7 = 1.40 ± 0.42 (see dashed line in Fig. 5). It is 
clear that the correlation length increases with richness, 
as expected from its well known richness dependence of 
the correlation function (eg. Bahcall & Burgett 1986). 



Note, that in section (2.4) we further investigate this 
issue in more detail. 



2.3 The HI selected galaxy real-space 
correlation function 

The previous results are hampered by the fact that the 
analysis has been performed in redshift space and there- 
fore the derived correlation functions are amplified in 
the linear regime by the factor K(/3) (Hamilton 1992, 
also see Hawkins et al. 2003) given by: 

m) = 1 + f + y - (5) 

where /3 ~ Q^'/bo, and b is the present day bias factor 
of the HI selected galaxies. Determining this bias factor 
and using f2 m = 0.26, valid for the concordance cos- 
mological model, we can evaluate the K(f3) factor and 
recover the real-space HI galaxy correlation function. 

To this end, we compare the theoretically expected 
mass-tracer correlation function, in the concordance 
CDM cosmology, with our derived correlation function. 
It is well known (eg. Kaiser 1984; Benson et al. 2000), 
that according to the linear biasing ansatz, the corre- 
lation functions of a mass-tracer (£ bj) and the dark- 
matter ((dm), are related by: 

Cobj(r) = boCoM(r) . (6) 

We quantify the clustering of the underlying mass dis- 
tribution using the spatial correlation function of the 
mass, ^DM(r), which is the Fourier transform of the spa- 
tial power spectrum P(k): 

ZoM(r) = ^J\ 2 P(k)^l Ak , (7) 

where k is the comoving wavenumber in units of h 
Mpc -1 . As for the power spectrum, we consider that 
of CDM models: P(k) ~ k"T 2 (k), with scale- invariant 
(n = 1) primeval inflationary fluctuations. We utilize 
the transfer function parameterization as in Bardeen et 
al. (1986), with the approximate corrections given by 
Sugiyama's (1995) formula: 

r(fc) = Ml+^ [1+3 . 89g + ( 16.1 9 f + 

(5.46g) 3 + (6.71g) 4 n 1/4 . 

with 



n m h 2 ox P [-n b - (2h) 1 / 2 n b /n m \ v ; 
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where fit is the baryon density. Note that we also use the 
non-linear corrections introduced by Peacock & Dodds 
(1994). 

In the present analysis we consider the concordance 
model (f2 m = 0.26) with cosmological parameters, that 
fit the majority of observations, f2 m + Qa = 1, Ho = 
100/ikm s^ 1 Mpc -1 with h ~ 0.72 (cf. Freedman et 
al. 2001; Peebles and Ratra 2003; Spergel et al. 2003; 
Tonry et al. 2003; Riess et al. 2004; Tegmark et al. 2004; 
Basilakos & Plionis 2005; 2006; Spergel et al. 2006 and 
references therein), baryonic density parameter Qbh 2 — 
0.02 (e.g. Olive, Steigman & Walker 2000; Kirkman et al 
2003). Note that the concordance model is normalized 
to have fluctuation amplitude, in spheres of 8 h~ Mpc 
radius, of as — 0.74 in agreement with both the recent 
WMAP 3-years results (Spergel et al. 2006) and the 
clustering of the XMM X-ray sources at relatively high 
redshifts (Basilakos & Plionis 2006). 

In order to derive the HIPASS bias at the present 
time we perform a standard \ 2 minimization procedure 
between the HIPASS galaxy correlation function in real 
space, £ z (r)/K(/3), and that expected in the concor- 
dance cosmological model: 



2 (M = E 



a(r)/K(P)-a bj (r,bo) 



(9) 



where a 1 is the observed correlation function uncertain- 
ties. The outcome of this analysis provides the HI galaxy 
bias factor at the present time: bo = 0.68 ± 0.10. Thus 
evidently the HI selected galaxies, in the local Universe, 
are anti-biased with respect to the underlying matter 
distribution, implying that they generally trace low- 
density regions. In the right panel of Fig. 3, we plot 
the £(r) in real space of the HIPASS sample and the 
theoretically expected one (continuous line) for the con- 
cordance model using bo = 0.68. It is obvious that they 
compare extremely well. 

Performing now the same analysis for the two sub- 
samples, we find for the Si subsample (low HI masses) 
a value of: bo — 0.48, which again means that from the 
statistical point of view the low HI mass galaxies trace 
low density regions in the local universe. However, for 
the S*2 case we get bo — 0.94, in agreement with that 
found using optical galaxies (eg. Verde et al. 2002; La- 
hav et al. 2002). 

In Table 1 we list our derived values of the HI bias 
factor, bo, at the present time, as well as the redshift 
distortion (3 parameter and a measure of the K((3) cor- 
rection. Multiplying with 1/K(f3) each bin of our red- 
shift space correlation function £ z (r) and repeating our 
fitting procedure we derive an estimate of the real-space 
correlation length which is: rir = 2.4 ± 0.3 ft" 1 Mpc for 
7 = 1.38 and r r c = 2.5 ± 0.2 7i _1 Mpc for 7 = 1.8. 

It is interesting to mention that Ryan- Weber (2006) 
and Meyer et al. (2007) found a somewhat larger cor- 
relation length in real space, rj = 3.45 ± 0.25/i _1 Mpc 
with 7 = 1.47 ± 0.08 and r, r , c = 3.5 ± 0.7/i _1 Mpc with 
7 = 1.9 ± 0.3, respectively. This difference should prob- 
ably be attributed to the intrinsic uncertainty of the 
different methods used to correct the measured galaxy 
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Figure 5. The spatial two-point correlation function in red- 
shift space for the Si (solid points) and S2 (open squares) 
subsamples. The error bars are estimated using the bootstrap 
procedure. The lines represent the best-fit power low model, 
£(r) = (ro/r) 7 (sec parameters in Table 1). 



two-point correlation function for redshift-space distor- 
tions. 

Note that for the £1 and S2 subsamples we find 
r T c = 1.8 ± O^/i^Mpc and r ( r , c = 3.9 ± 0.6ft _1 Mpc for 
7 = 1.8. Finally, we would like to stress that the low- 
mass HI galaxies appear to have the lowest correlation 
length among all extragalactic populations studied to- 
date. 



2.4 The Environment of HI galaxies 

In order to investigate the cause for the relatively low- 
amplitude of the HI galaxy correlation function, we de- 
cided to correlate the positions of the HI galaxies with 
the reconstructed smoothed density field of the PSCz 
flux limited (0.6 Jy) IRAS galaxy catalogue. We use the 
reconstructed density field of Branchini et al. (1999), 
smoothed as in Plionis & Basilakos (2001) utilizing a 
Gaussian kernel on a 3D grid with grid-cell size and a 
smoothing scale (R Bln ) equal to 5 ft -1 Mpc. We also cor- 
rect for effects related to the unavoidable convolution of 
the constant smoothing window with the degradation of 
the redshift selection function (see Plionis & Basilakos 
2001 for details). We then cross-correlate the positions 
of each HI galaxy with the PSCz smoothed density field 
and assign to each the overdensity of its nearest grid 
cell. In this way we create the 1-point PSCz overden- 
sity probability distribution which is related to the HI 
galaxy positions and compare it with the corresponding 
full PSCz 1-point distribution function. Note that we 
exclude from our analysis the inner volume (r < 15 ft -1 
Mpc) in order to avoid discreteness effects, ie., associat- 
ing a large number of HIPASS galaxies with only a few 
PSCz density grid-cells (16% of HI galaxies with 0.33% 
of the total grid-cells) . 

In the left panel of Fig. 6 we present the two 
frequency distributions, for which the Kolmogorov- 
Smirnov two-sided test shows that the probability of 
them having the same parent distribution is < 10~ 10 . 
In the right panel of the same figure we show the ra- 
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Figure 6. Left Panel: The 1-point PSCz overdensity dis- 
tribution function related to the positions of the HI galax- 
ies (solid line) and the corresponding full PSCz distribution 
(dashed line). Right Panel: The ratio of the two distributions. 



tio of the two distributions as a function of Sp/p. If 
the HI galaxies would trace the same structures as the 
IRAS-PSCz galaxies then the two distribution should 
be indistinguishable and the ratio would be ~ 1. The 
results presented in this panel are very informative re- 
garding the local environment of HI galaxies. We see 
that the HI galaxies avoid high density regions, as well 
as the very low density regions. They prefer, with re- 
spect to IRAS galaxies, regions of quite low galaxy den- 
sity (-0.4£ Sp/ppsc,^ 0). 

In order to verify the other outcome of our HI 
galaxy correlation function analysis, ie., the fact that 
there appears to be a dichotomy between the clustering 
properties of the low and high mass subsamples of HI 
galaxies, we decided to compare the PSCz density dis- 
tribution corresponding to the positions of the high and 
low HI mass galaxies. In Fig. 7 we show this ratio and it 
is evident that with respect to the high mass HI galaxies, 
the low mass ones populate typically only under-dense 
void-like regions (Sp/ ppscz< 0). We find that 75% of 
low-mass HI galaxies are associated with negative PSCz 
overdensities, with 40% having Sp/ppscz < —0.3. This 
fact corroborates our previous clustering analysis and 
the resulting extremely low-bias parameter of these HI 
galaxies (bo — 0.48). It should also be kept in mind 
that with respect to the optical, infrared galaxies typ- 
ically avoid the high-density regions, like those related 
to clusters of galaxies, a fact which translates to a PSCz 
correlation length of ro — 3.7h~ Mpc (Jing et al. 2002), 
while the PSCz bias parameter, within the concordance 
cosmological model (fi m = 0.26), has been found to be 
6pscz ^ 0.9 (eg., Basilakos & Plionis 2006). 

Furthermore, these results are very interesting in 
themselves because they indicate that low-density void- 
like regions could well be filled by such HI low-mass 
galaxies, although it also appears that the lowest density 
regions are avoided also by such galaxies. We believe 
that this issue will be settled in the next few years with 
the new alfalfa survey, which is expected to observe 
about 16000 HI galaxies, most of them of low mass, in 
7000 square degrees (Giovanelli et al. 2005a; 2005b). 
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Figure 7. The ratio of the 1-point PSCz overdensity distri- 
bution functions of the low and high HI mass galaxies. 



3 HI GALAXIES BIAS EVOLUTION 

In order to understand better the importance of the 
measured HI galaxy clustering, we investigate the evo- 
lution of the HI bias factor as a function of redshift. The 
evolution of neutral hydrogen as a function of redshift 
is a powerful cosmological tool because it indicates the 
rate of transformation of gas into stars and thus the gas 
consumption and the evolution of star formation in the 
Universe. 

Over the past two decades, based on different as- 
sumptions, a number of bias evolution models have been 
proposed (eg. Nusser & Davis 1994; Fry 1996; Mo k, 
White 1996; Matarrese et al. 1997; Tegmark & Peebles 
1998; Basilakos & Plionis 2001). Here we will use only 
models that have been shown to describe relatively well 
the evolution of bias even beyond z ~ 1. These are: 

• Merging Bias Model (hereafter Bl): Mo & White 
(1996) have developed a model for the evolution of the 
the so-called correlation bias, using the Press-Schechter 
formalism. Utilizing a similar formalism, Matarrese et 
al. (1997) extended the Mo & White (1996) results to 
include the effects of different mass scales (see also Bagla 
1998). In this case the expression which describes the 
bias evolution is 



& B rO) = 0.41 + 



(bo - 0.41) 

DP{z) 



(10) 



with p ~ 1.8. Note that D(z) is the linear growth rate 

of clustering (cf. Peebles 1993) * scaled to unity at the 
present time. 



D{z) 

with 

E(z) 



5n m E(z) 



[fim(l - 



°°(1 + *) 



dx 



(11) 



(12) 



■f D(z) = (1 + z) 1 for an Einstein-de Sitter Universe. 
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• Bias from Linear Perturbation Theory (hereafter 
B2): Basilakos & Plionis (2001, 2003), using linear per- 
turbation theory and the Friedmann-Lemaitre solutions 
of the cosmological field equations have derived analyt- 
ically the functional form for the evolution of the lin- 
ear bias factor, 6, between the background matter and 
a mass-tracer fluctuation field. The main assumptions 
of this model is that the DM and galaxies share the 
same velocity field and that the mass tracer population 
is conserved in time. For the case of a spatially flat A 
cosmological model (fi m + Qa = 1), the bias evolution 
can be written as: 



& B2 (2) = CiE(z) + C 2 E(z)I(z) + 1 



with 



I(z) = 



y 3 dy 



(13) 



(14) 



Note that this model gives a family of bias curves, due 
to the fact that it has two unknown parameters, (the 
integration constants C\,C-i). Basilakos & Plionis (2001, 
2003) compared the B2 bias evolution model with other 
models as well as with the HDF (Hubble Deep Field) 
biasing results (Arnouts et al. 2002), and found a good 
consistency. Of course in order to obtain partial solu- 
tions for b(z) we need to estimate the values of the con- 
stants Ci and C2, which means that we need to calibrate 
the b(z) relation using two different epochs: &(0) = b 
and b(zi) = b%. 

Therefore, utilized the general bias solution (see 
eq.13), it is routine to obtain the expressions for the 
above constants as a function of bo and 61: 



Ci = 



(bo - l)E( Zl )I( Zl ) - (h - 1)£(0)J(0) 
E(Q)E(z 1 )[I(z 1 )-I(0)] 

E(0)(b! - 1) - E(Z!)(bo - 1) 

E(Q)E(z 1 )[I(z 1 )-I(0)] ' 



(15) 
(16) 



where for the present epoch we have: 60 — 0.68, E(0) = 
1 and 1(0) ~ 11.54. 



3.1 The HI bias factor in the distant Universe 

The hyperfine transition responsible for the HI emis- 
sion is very weak and therefore with current telescopes, 
studies of the emission at 21-cm are limited to the very 
local Universe (z < 0.2). This implies that the estima- 
tion of the bias parameter 61 of HI sources in the dis- 
tant universe (z > 1.5) would not be an easy task. At 
such redshifts, our knowledge on the neutral gas comes 
mostly from the absorption lines detected in the spectra 
of background quasars. 

At redshifts 1.5 < z < 5.0 most of the cosmic HI 
mass is contained in Damped Lya systems (DLAs, e.g. 
Wolfe et al. 1986, Lanzetta, Wolfe & Turnshek 1995). 
The DLAs are quasar absorption line systems with HI 
column density Nhi > 2 x 10 20 atoms cm -2 (see Peroux 
et al. 2001; Wolfe et al. 2005) and they are composed of 
predominantly neutral gas. Wolfe (1986) established the 
given column density threshold so that it corresponds to 
the surface density limit of the local 21-cm observations 




Figure 8. The evolution of the HI galaxy bias factor for the 
Bl (continuous line) and B2 (dashed line) bias models, re- 
spectively. The solid circle represents the observational bias 
for the Damped Lyman-a systems (DLA) of Cooke et al. 
(2006). The square-like symbol corresponds to Lyman break 
galaxies (LBG, Steidel et al. 1998) while the open circle rep- 
resents the bias for the Lyman-a emitters (LAE) derived by 
Ouchi et al. (2005) and Kovac et al. (in preparation). Note 
that the dot-dashed line represents the bias behavior for the 
LAE galaxies. 



at that time, which roughly corresponded to the transi- 
tion from primarily ionized gas to predominantly neu- 
tral gas (e.g., Viegas 1995, Prochaska 1999, Vladilo et al. 
2001). Therefore, these systems play a vital role in the 
structure formation because the large column densities 
can protect large amounts of neutral gas from the ion- 
izing background (Zwaan & Prochaska 2006) and thus 
produce a favorable environment for star formation. To 
this end it is interesting to mention that Zwaan et al. 
(2005b) have found that in the redshift range of z ~ 5 to 
z = the vast majority of the mass density in HI sources 
is locked up in column densities of Nm > 2 x 10 20 atom 
cm 2 . 

We will be using the above ideas to calibrate the B2- 
model, based on the Cooke et al. (2006) value of the 
bias for Damped Lyman-a systems which is b(3) ~ 2.4 

(see solid point in Fig. 8 ) §. 



3.2 The global HI bias results 

Knowning the HI galaxy bias factor at the present time, 
we can use eqs.(15) and (16) and the observed DLA bias 
factor at z ~ 3 to plot the B2 bias evolution model for 
the concordance cosmological model (see Fig. 8). We also 
plot the Bl evolution model, which is uniquely deter- 
mined from the present day HI galaxy bias factor (Bl 
continuous line and B2 dashed line). Evidently, up to 
z < 1.3 we have an anti-bias picture but due to the 
fact that the bias is a monotonically increasing function 
of redshift, for both Bl and B2 biasing models, the HI 
sources at higher redshifts are biased with respect to 
the underlying matter distribution. Note also that both 



3 For 2 = 3 and f2 n 
and 1(3) ~ 7.47 



1 - n A = 0.26 we get E(3) ~ 4.17 
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models provide almost the same bias evolution behavior, 
while for zj> 5 the two models diverge. 

We further compare our analytic solutions with ob- 
servations of: (a) Lyman-break galaxies (hereafter LBG; 
see solid square in Fig. 8) which are strongly biased with 
respect to the underlying mass and have 6(3.4) ~ 4 
(Steidel et al. 1998; Adelberger et al. 1998; Kashikawa et 
al. 2006), and (b) Lyman-a emitters (hereafter LAE; see 
open circles in Fig.8) with 6(5.7) = 3.8 ± 1.8 (Ouchi et 
al. 2005) and 6(4.5) ~ 2.6 (Kovac et al. in preparation). 
For the LAE case we present in Fig.8 the B2 bias evo- 
lution model, anchored at z = 4.5 (dot dashed line)T 
Within this bias evolution model the LAE galaxies ap- 
pear anti-biased up to z ~ 2.1, while beyond a redshift 
of z ~ 3 they become strongly biased with respect to 
the underlying mass fluctuations. 

The strong biasing predictions, in agreement with 
those found from simulations of galaxy formation (eg. 
Kauffmann et al. 1999), imply that Lyman-a systems, 
at high redshifts, are formed at the highest peaks of 
matter density field (eg. Mo & White 2002). The last few 
years many authors using Lyman-a systems have found 
high overdensities (8 > 50) at high redshifts. Indeed, 
Steidel et al. (1998) using LBGs found a protocluster 
at z ~ 3.1. Recently, Venemans et al. (2004), Miley et 
al. (2004), Ouchi et al. (2005) and Interna et al. (2006) 
found protoclusters of LAE galaxies at high redshifts 
(z ~ 4). Also a cross correlation analysis has shown 
(Cooke et al. 2006) that the LBGs are associated with 
the DLA systems at the same redshift, a fact which 
implies that perhaps the LBGs trace the same parent 
system as the DLAs (Cooke et al. 2005). 



4 CONCLUSIONS 

We have studied the clustering properties of the HI 21- 
cm emission line sources from HIPASS catalogue in red- 
shift space. Modeling two point correlation function as 
a power law, £(r) = (r /r) 7 , we find r = 3.3 ± 0.3/i -1 
Mpc and 7 = 1.38 ±0.24. Fixing the slope to its univer- 
sal value 7 = 1.8, we obtain r = 3.2 ± 0.2/i _1 Mpc. 

Comparing the measured spatial correlation func- 
tion for the HI selected galaxies with the theoretical 
predictions of the preferred ACDM cosmological model 
(fi m = 1 — = 0.26) and a bias evolution model, 
we find that the present bias value is 60 ~ 0.68, sug- 
gesting that the HI selected galaxies are located in rela- 
tively underdense regions in the local Universe, in agree- 
ment with previous studies (Grogin & Geller 1998). 
Using the derived value of the bias we determine the 
redshift-space distortion parameter, K(f3), and use it 
to derive the real-space correlation function, for which 
we have: r T c = 2.4 ± 0.3/i _1 Mpc with 7 = 1.38 and 
r T c = 2.5 ± 0.2/i _1 Mpc for 7 = 1.8. 

Dividing our HI galaxy sample into two richness 
subsamples we find (a) for M H i < 2.475 x W 9 h~ 2 M Q 
that r r c ~ 1.8/T 1 Mpc (for 7 = 1.8) and b ~ 0.48 while 
(b) for Mm > 2.475 x 10 9 /i" 2 M Q that r ( r , c ~ 3.9/r 1 Mpc 

1 For 2 = 4.5 we get £(4.5) ~ 6.64 and 7(4.5) ~ 6.41 



(for 7 = 1.8) and 60 — 0.94. Therefore, the low mass HI 
galaxies should trace low densities, due to very low bias- 
ing found while the high mass HI galaxies should trace 
average galaxy densities as optical galaxies do (60 — 1). 
Indeed, we corroborate these results by correlating the 
HI galaxy positions with the IRAS-PSCz density field, 
smoothed over scales of 5 ft -1 Mpc, to find that indeed 
the HI galaxies are typically found in negative overden- 
sity regions (<5p/ppscz< 0), even more so the low-mass 
HI galaxies. We conclude that such galaxies could be 
the typical population of galaxies in void-like regions. 

Finally, we investigate the redshift evolution of the 
HI galaxy linear bias factor and we find that the anti- 
bias behavior extends up to z < 1.3. 
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